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Abstract. A fundamental problem of non-equilibrium statistical mechanics is 
the derivation of macroscopic transport equations in the hydrodynamic limit. 
The rigorous study of such limits requires detailed information about rates of 
convergence to equilibrium for finite sized systems. In this paper we consider 
the finite lattice {1, 2, . . . , A^}, with an energy Xi G (0, oo) associated to each 
site. The energies evolve according to a Markov jump process with nearest 
neighbor interaction such that the total energy is preserved. 

We prove that for an entire class of such models the spectral gap of the 
generator of the Markov process scales as 0{N~^). Furthermore, we provide 
a complete classification of reversible stationary distributions of product type. 
We demonstrate that our results apply to models similar to the billiard lattice 
model considered in 1101 . and hence provide a first step in the derivation of a 
macroscopic heat equation for a microscopic stochastic evolution of mechanical 
origin. 

1. Introduction 

1.1. Motivation and related works. A fundamental problem of non-equilibrium 
statistical mechanics is the derivation of effective equations in the hydrodynamic 
limit. Often these are hydrodynamic equations (Euler, Navier-Stokes) , or related 
transport equations (Burgers equation, heat equation). There are very few models 
for which rigorous results exist. They include particle models like simple exclusion, 
zero range processes, see |15j and references therein, and continuous systems like 
the Ginzburg-Landau equation [71 [6j [11] and the model of jH] . 

The rigorous study of hydrodynamic limits requires detailed information about 
rates of convergence to equilibrium for finite sized systems, especially if the system is 
of non-gradient type. In particular, the scaling of the spectral gap of the generator 
with the system size N is of crucial importance. Such information is typically 
obtained by analyzing the Dirichlet form, corresponding to the explicitly known 
stationary distributions. 

Obtaining good estimates (in terms of the system size) on the spectral gap of the 
generator is highly non-trivial. For example, to obtain the corresponding results 
for the Kac model [13] it took almost half a century [12] (using Yau's martingale 
method [M [19] ) and [B [2] . 



Date: September 13, 2011. 

A.G. gratefully acknowledges the generous support and warm hospitality of the Fields Institute 
and the Mathematical Institute of the Budapest University of Technology and Economics. 

D. Sz. expresses his sincere thanks to Department of Mathematics of University of Toronto 
and to the Fields Institute, Toronto where part of this work has been done (Fall term 2009 and 
June 2011, resp.), and is grateful to Hungarian National Foundation for Scientific Research grants 
No. T 046187, K 71693, NK 63066 and TS 049835. 

1 



2 ALEXANDER GRIGO, KONSTANTIN KHANIN, AND DOMOKOS SZASZ 

Recently there has been a growing interest in and hope for estabhshing hydrody- 
namic hmits for systems that are either purely deterministic or originate (somehow) 
from deterministic, in particular mechanical, models. A program to obtain informa- 
tion about the stationary distributions under the influence of stochastic boundary 
conditions was proposed in [5]. Another approach was suggested in the recent se- 
ries of papers 'W, 'TO , where the analysis of the hydrodynamic limit of a billiards 
lattice model was outlined by following a two-step procedure. In the first step the 
deterministic dynamics is rescaled in order to obtain a mesoscopic stochastic model 
(also referred to as master equation). In a second step the hydrodynamic behavior 
of the mesoscopic stochastic model should be derived. 

For neither of the two steps proposed in |8l [10] rigorous results are available. 
Deriving master equations from interacting mechanical models is a very difficult 
problem. Only recently some rigorous results in this direction were obtained in 
[1], where the weak interaction limit is considered opposed to the rare interaction 
limit of fS', To'. As a matter of fact, the second step, i.e. deriving the hydrody- 
namic limit from the master equation, seems a much more tractable mathematical 
problem. The present paper is an attempt to make a first step in this direction 
by providing information about the spectral gap of the generator of an entire class 
of models, which are of similar type as the master equation of the billiard lattice 
model considered in [51 [TU]. In particular, the model 10 belongs to the class of 
models we are considering, the obtained spectral bound is exactly the necessary 
one for which the derivation of the hydrodynamics limit is feasible. 

1.2. Description of the model. The model we consider in this paper is as follows. 
Let iV > 2 be an integer, and consider the lattice {1, 2, . . . , A^}. To every site i 
of this lattice we associate an energy Xi, which is a positive real number. The 
collection of all the energies will be denoted hy x = {xi, . . . jXn) G K+. To each 
nearest neighbor pair of the lattice we associate an independent exponential clock 
with a rate A that depends on the total energy of this pair. As soon as one of the 
A'' — 1 clocks rings, say for the pair + 1), then a number < a < 1 is drawn, 
independently of everything else, according to a distribution P, that only depends 
on the two energies Xi, Xi+i. The update of the energies is then such that the new 
energy at site i is a (xi + Xi+i), the new energy at site i -I- 1 is (1 — a) (xi -|- Xi+i), 
and all other energies remain unchanged. 

This procedure defines a continuous time Markov jump process X(t) on R^. 
More formally, we define the process X(i) by its infinitesimal generator £, acting 
on bounded functions A : ^ M as 

JV-l 

(1) £A{x) ^ ^A(xi,Xi+i) / P{xi,Xi+i,da)[A{Ti^ax) - A{x)] 

i=i •' 

where A: — )• R_|_ is continuous, and P{xi,Xi+i,da) is a probability measure on 
[0,1], which depends continuously on (xi,Xi+i) G R^. The maps Ti ^ model the 
energy exchange between the neighboring sites i and i -f 1, and are defined by 

(2) Ti^aix) = x+ [axi+i - (1 - a)x.,] [e^ - e^+i] 
where e^ denotes the i-th unit vector of R^. 



Throughout this paper we will always assume that the various functions are Borel measurable 
without stating this assumption explicitly. This will not lead to confusion, since higher regularity 
assumptions (like continuity or integrability) are stated explicitly. 
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In particular, the process X(f) preserves the total energy, i.e. for any two times 
ti and t2 the identity X]i=i^i(^i) = Si=i^j(*2) holds. Therefore, we introduce 
for any e > the sets 

N ^ 

Se,N = e M^' : ^ - = e} 

which are invariant for the process X(i). The value of e represents the mean energy 
per site. 

Since S^^pf is compact and invariant the assumed continuity of A and P guaran- 
tees the existence of at least one stationary distribution tt^jv for X{t) on each iSg^jv- 
As we pointed out, the scaling of the rate of convergence towards the stationary 
distribution in terms of the lattice size N is of crucial importance in studying the 
hydrodynamic limit of this model rigorously. 

1.3. Outline of the paper. The purpose of this paper is to present a dynamical 
and geometric approach to establish the scaling of the spectral gap of the generator 
(HJ under rather general assumptions on the rate function A and transition kernel 
P. The strategy we adopt is as follows. In Section [3] we show that for a large 
class of rates A and transition operators P the scaling of the spectral gap of the 
corresponding generator ([T]) can be obtained by considering only the special case of a 
constant rate A and a state independent transition kernel P. The precise statement 
is formulated in Theorem 13. 11 which we prove under the two key assumptions: the 
reversibility of the process X(t), and the existence of a lower bound on the rate 
function A. The requirement of a lower bound on the rate function seems to be a 
technical condition, but it cannot be removed at present. 

In Section [S] we show that (a slight modification of) the three-dimensional sto- 
chastic billiard lattice model of pTO] is a special case of the general model considered 
in the present paper, provided that one introduces a lower cut-off for the rate func- 
tion originally considered in [1^. In particular, we show that it then follows that 
the spectral gap scales as 0{N~'^). 

Since we assume reversibility of the stationary distribution to derive the spectral 
properties, we provide in Section H] a classification of reversible stationary distribu- 
tions of product type. Such measures are of particular interest in the hydrodynamic 
limit, and appear naturally in mechanical models and statistical mechanics in form 
of Gibbs measures. We show in Theorem 14.31 that if a model of the class Q consid- 
ered in this paper admits a reversible product distributions, then this measure must 
necessarily be a product Gamma-distributions (or a single atom) . This is precisely 
the type of product measures considered in statistical mechanics for mechanical 
models. 

The main part of the paper deals with establishing the scaling of the spectral gap 
of the generator for the process with constant rates A and state independent tran- 
sition kernel P. This case is studied in Section[2] The key difference of our analysis 
when compared to the above mentioned related works is that instead of focusing 
directly on convergence, for example by analyzing the associated Dirichlet form, 
we first establish weak convergence towards a stationary distribution. For the later 



The parameter e denotes the average energy per site and should not be thought of as a 
necessarily small number. We hope that this does not cause any confusion, even though it is a 
common practice to reserve the use of the symbol e to denote a small number. 
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part it is crucial that this weak convergence is made quantitative in a sufficiently 
strong metric for the weak topology. For this purpose we use the Vaserstein dis- 
tance and prove in Theorem 12.91 that there is an exponential rate of convergence of 
X{t) to equilibrium, which scales as 0{N~'^) in the system size N. The key step 
in the proof is to construct an adapted metric on the state space of X(t), for which 
the contraction property can be established. This requires special coordinates and 
a coupling argument, which is presented in Proposition 12.61 and Proposition 12.81 

The advantage of first establishing exponential convergence in the weak sense 
is that it allows to include very general transition kernels P (for example, non- 
absolutely continuous kernels), and does make reference to the invariant measure. 
Instead it relies on a very natural geometric property of the interaction mechanism 
of X{t). 

In a second step we assume reversibility of the constructed unique invariant 
measure, and show that the convergence occurs at an exponential rate, which 
is explicitly related to the rate of convergence in Vaserstein metric. In particu- 
lar, this shows that the spectral gap scales as 0{N~'^) in the lattice size N. The 
precise statement is given in Theorem l2.12l whose prove relies on the Kantorovich- 
Rubinstein duality property of the Vaserstein metric, see Lemma 12.111 This is 
another manifestation of the usefulness of the weak convergence in Vaserstein dis- 
tance in the study of the spectral gap for interacting particle systems. 

Section [6] contains final comments and conclusions. 

2. Analysis of a special case 

In this section we consider a special case of the class of processes defined by 
generators of the form ([T]). Namely we consider the case where the rate function 
A is constant, and the transition kernel P is state independent. In other words we 
consider a process X(t) with infinitesimal generator 

N-l 

(3) CA{x) = A ^ / P{da) [A{T,^c.x) - A{x)] 

1=1 

acting on the space of bounded observables A : M. 

As was already mentioned the process X(t) preserves the total energy. This im- 
plies that the process cannot have a unique stationary state on all of M^. However, 
we will show below that the restriction of the process to any of the invariant sets 
Se,N has a unique stationary distribution. 

The first step in this direction is to introduce more convenient coordinates on 
Se^Ni which is the purpose of the next result. 

Lemma 2.1 (x in terms of u). Let N and e be fixed. Then any x G S^^n can be 
uniquely written as 

N-l 

x = el + Ui[ei - Gi+i] 
1=1 

for some u G M^"""^, where 1 denote the vector (1,...,1). Furthermore, via this 
change of coordinates the set S^.n C is in one-to-one correspondence with the 
set 

St.N = {m G M^^"'^ : — e < Wi, < e + Ui, u^^i < e} . 
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Note that the vectors — e^+i for i ^ 1, . . . , N — 1 span the simplex Se^N, but 
they are not mutually orthogonal. However, they almost are in the sense that any 
two of them are perpendicular as soon as they correspond to two values of i, which 
differ by at least 2. 

In the following we will also need the inverse coordinate transformation, which 
expresses u in terms of x. 

Lemma 2.2 {u in terms of x). Let x € be given. Then the corresponding e 

is given by e = X]fc=i w '^^^ corresponding u is the solution to the discrete 
Poisson equation with Dirichlet boundary conditions 

Ui^i -2ui + u,+i = x^+i - Xi for i = 1, . . . , iV - 1 

where we formally set uq = = 0. More explicitly 



i . N 

^ ~ Jj]'^^'' ~ for all I < i < N - 1 

k=l " k=l k=i+l 



Ui = y (xk 



is the expression for the corresponding u G 



Proof. Clearly, x G S^^n if and only if e is given by the claimed formula. Fur- 
thermore, it follows immediately from the definition of the coordinates u, that 
Xi = e ^- Ui — Ui-i for all i, where we use the convention uq = un = 0. This im- 
plies that u must solve the discrete Poisson equation with zero Dirichlet boundary 
conditions. 

On the other hand we can sum up the expression for Xi in terms of u and obtain 
a telescoping sum, which yields 

i i 

Ui = ^(wfc - Uk-i) = ^{xk - e) 

k=l k=l 

for alH = l,...,Ar- 1. 

And since cN = J^iLi '^^ '^^^ replace e in terms of this sum, and thus obtain 
the second expression for Ui. □ 

The point of the change of coordinates from x to e and u is to separate out the 
conserved quantity e, and consider only the evolution of the nontrivial part U(t) of 
the process X(i) 

AT-l 

(4) X(i) =el+ ^ U,(i)[e.-e,+i] , 

namely the u-coordinate vector corresponding to X(t). Since e is conserved it follows 
that (J{t) is also a homogeneous Markov process (for each e separately). Using the 
results of Lemma l2.1l and Lemma l2.2l we can now derive the infinitesimal generator 
of U{t). 

Lemma 2.3 (The generator of U(<)). Let N and e be fixed. Then the process U(t) 
is a homogeneous Markov process on S^^n, whose infinitesimal generator C^.n is 
given by 

N~i „ 

C^.NAiu) = A ^ / P{da) [Aifl^u) - A(m)] , 
i=i •' 
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where 

fl^u -u=[{l-a) + a Ui+i + (2 a - 1) e - Ui] e M^^^ 
with the convention uq = ujv = 0. 

Proof. From its definition ([2]) we have Ti^a(x) = x+[a Xi+i — (1 ~ a) Xi] [e^ — e,;+i]. 
Note that [T^ agrees with Xk for all k different from i and i + 1, and [Ti^o-xji + 
[Ti.ax\i+i equals Xi + x^+i (local energy conservation). Therefore, [T^ ^u\k equals 
Uk for all k ^ i, because by Lemma [2.21 we have Ui — — e)- 

So it remains to consider \T^^u\i. Using the above two expressions for u and 
Ti^a{x) we obtain 

i i 
Ka^li - = ^{[Ti.a{x)]k - e) - ^{Xk " «) = [Ti^a{x)]i - X^ 
k=l k=l 

— a Xi-i-i — (1 — a) Xj . 

Using Lemma |2. II we can express x in terms of u as e + — where we 

used the convention uq = un = 0. Substituting this expression in the previous 
formula yields the claimed expression for — u. Furthermore, this (trivially) 

also shows the claimed expression for the infinitesimal generator of U(t). □ 

2.1. Weak convergence. Fix again the values of e and N . To study the existence 
of and rate of convergence to a stationary distribution we consider a bivariate 
Markov process (U(t), U'(t)) on S^^m x S^^N: whose infinitesimal generator 

(5) CA{u, = A ^ / P{da) [A{fl^u, fl^u') - A{u, u')] 

2—1 

for any (bounded) observable A on S^.n x S^^n- Note that this is a special Markov 
coupling of two copies of the Markov chains generated by C. 

In order to analyze the weak convergence of the process X(i) towards a stationary 
distribution we consider the Vaserstein metric on the probability measures on 5c _ at. 
This requires, however, a metric d(., .) on S^.n- We equip S^.n with the Euclidean 
metric 

N-l 

,'^2 



(6a) d{u,u'):^[Y,{u,-u[f 
which corresponds to the metric 

(6b) A{x,x')=[Y.{^{xk-A') ]'^d(^.,^') 

i=\ k=\ 

on S^^jsi- In particular, we the have following estimate on the diameter of S^^i^. 
Lemma 2.4 (Diameter of 5e^Ar). Let e and N be fixed. Then 



max d{x,x')= max d(u, u') < e N y/ N — 1 



holds. 
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Proof. By Lemma 12.11 it follows that for any u £ S^.n the inequality —ie<Ui< 
e{N -i) holds for alH = 1, . . . , iV - 1. Therefore, 



N-1 



j=l i=l 

for any two u and u', which implies the claim. 



□ 



The following Proposition 12.61 provides the first step to estimate d(U(t), U'(t)). 
A particular role will be played by the matrix 



(7) 



/ 2 





-1 








\ 





2 





-1 







-1 





2 





-1 






-1 





2 





-1 







-1 





2 





V 








-1 





2/ 



oNxN 



The spectral properties of C'-^ are provided by the following Lemma 

Lemma 2.5 (Spectrum of C'-^^^-'). If N is odd, then the eigenvalues of C^^~^^ 
are given by 



4 sin 



r TV k 



N + 1 



for k — 1 , . . 



TV- 1 



where each has multiplicity two. If N is even, then the eigenvalues of C^^ ^' are 
given by 



4 sin' 
4 sin^ 

each of multiplicity one. 



TT k 



N 

r TT /c 



N + 2 



for 
for 



k = l, 
k = l. 



N 
N 



Proof. By the definition of C*^^-* we see that the even and odd indices separate. In 
fact, it is readily seen that the action of C^^^ on the odd indexed (ui, U3, . . .) and 
the even indexed (1*2, W4, . . .) entries of u is given by the action of the matrix 



A 





-10 


\ 


-1 


2-10 






0-12 


-1 


V 


00-1 


2/ 



It is readily verified that if ^ e jjmxm^ then for k = 1, . . . ,m the vectors (sin[7r k • • • , sin[7r k 

are eigenvectors of A corresponding to the eigenvalues 



4 sin"' 



TT k 

-2(m + 1)- 



for 



fc = 1, . . . , m . 
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If N is odd, say N — 2m + l for some m > 1, then there are m odd and m even 
indexed entries in u e R^^^. Therefore, the eigenvalues of C^^™^ are given by 



4 sin 



TT k 



2(m + 1) 



for 



, , m 



where each has muhiphcity two. 

If N is even, say N — 2m + 2 for some m > 1, then there are m + 1 odd and m 
even indexed entries in m e K.''^""'^. Therefore, the eigenvalues of C'-'^™"'""'^^ are given 

by 



4 sin"' 



4 sin 



TT k 



2(to + 1) 

TT fc 



-2 (to + 2) 
where each has multiplicity one. 



for fc = 1 , . . . , m 
for k = 1, . . . ,m + 1 



□ 



Proposition 2.6 (Average contraction rate). Assume that the transition kernel P 

hen 



satisfies JP{da)a—^. Th( 

C[d{u,u'f] < -A[1-4ct|,] sin^ 



iV + 2 



d(u,M')^ 



holds for any two states u and u' , where ap denotes the variance of P. 

Remark 2.7. Since P is supported on [0, 1] and is assumed to have mean J P(da) a - 
i it follows that the variance of P satisfies 0< 1 — 4crp < 1. 

Proof of Proposition \2.6[ From the definition of the generator C and the distance 
d(., .) it follows 

JV-l 



i=l 



and 



N-l 



fc=l 



fc=i 



Making use of the explicit expression for T^^u — u provided by Lemma 

TiaU - M = [(1 - a) Uj-i + a Wi+i + (2 a - 1) e - u^] 
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the above sum simplifies to 
dm>,f;>')'-d(«,«')' 



■ [T;>-w]i-[T;>'-«']i + 2(«i-«0 

= (1 - a) - + a [u,+i - - [u, - u[] 

■ {I -a) [ui-i - + a [ui+i - + [ui - w^] 

r 1 ^ 

= (l-a)[Mi-i-u-_i]+a[ui+i-u-+i]J -[ui-u-]^ 

= (1 - Cxf [Ui-i - u'i_-^f + C? [Ui+i - U'+i]^ 

+ 2 a (1 - a) - [wj+i - - - u-]^ 



which in particular shows that the above expression depends only on the difference 

vector M — u'. 

Performing now the sum over i yields 



N-2 



N-1 



i=l 



^ [d(f fl^u'f - d(u, u'f] = (1 - a)' E - "'il' + E - "^]' 

N-2 N-1 

+ a (1 - a) ^ 2 - - - ^ k - m^]^ 



i=l 



where we made use of the convention uq = un = u'q = u'j^ = 0. 
Note now that the assumption J P{da) ct = ^ implies 

j P{da) oc^ = j P{da) {l-af =al + ^, j P{da) a{l-a) = ^-a 



and hence 
1 



N-2 



£.[d{u,u'f] = 



/ P{da){l - a)2 J2 [ui - uT + / Pida)a'^ ^ [ 

i=l 1=2 

/iV-2 
P{da)a (1 - a) ^ 2 [ui_i - [m+i - u^_^_-^] 
A O 



iV-1 



1-4ct 



2 ,-^-1 



N-2 



1+4(72 



1=1 i=2 
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It is now straightforward to verify that 



C[A{u,u'f 



-A- 



l-4crf 



-A 



1 + 4ct|, 



nT^(JV-l) 



[ui - u'^Y + [UN-I - "W-l]' 



where the matrix C*^^ was defined in ([7]) above. 

Observe that by Lemma [275l the smallest eigenvalue of C^^"'^-' equals 4 sin^[-j^q:Y] 
if N is odd, and 4 sin^[-j^5^] if N is even. Therefore, 

C[A{u, u'f] < -A [u - m']^C(^-i) [u - u'] 



< -A [1 - 4ap] sin^ 



N- 



d{u, u'y 



follows from the fact that C*^^ is a symmetric matrix, and < 1 — 4(Tp. 



□ 



Let U and U' be any two random variables on S^^n with distribution denoted by 
fi and n' , respectively. Recall that for p>l the Vaserstein-p distance is defined by 



Pp(U, U') = Ppifi,n') = inf 



r(du, du) d{u, u'y 



where the infimum is taken over all couplings F of /i and /x'. To shorten the notation 
we set /i') = pi{fi, /i') in the special case p — 1. 

Proposition 2.8 (Rate of convergence in Vaserstein-2 distance). Assume that the 
transition kernel P satisfies J P{da)a — ^. Let U{t) and U'(t) be two Markov 
chains generated by C on iSg^jv- Then for all t > 

P2(U(i), U'W) < P2(U(0), U'(0)) exp ( - i A [1 - 4 4] sin^ 



< e N VN -1 exp - - A [1 - 4 aj,] sin^ 



-N- 



N- 



Proof. Denote the distribution of the bivariate Markov process (U(t),U'(t)) with 
generator C by Tt{du,du'), and denote by fit{du) and ^^{du') the two marginals. 

Observe that the generator C of this bivariate process (U(t), U'(t)) is constructed 
in such a way that U(t) and U'(t) are Markov chains with generator C whose 
distributions are given by ptidu) and /2f{du'), respectively. 

Therefore, Tt{du,du') is a coupling of the two distributions iitidu) and p^{du') 
for all t > 0. In particular. 



P2(U(t),U'(t))2 < / Vt{du,du')A(u,u'f 

follows from the very definition of the Vaserstein distance. 
By the Markov property of the bivariate chain 

d(U(t),U'(t))2-d(U(0),U'(0))2- I £d(U(s),U'(s))2ds 

is a centered martingale. Hence for alH > 

Ed(U(t), U'(t))2 = Ed(U(0), U'(0))2 - 



E/:d(U(s),U'(s))2ds 
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Differentiating with respect to t and applying the estimate of Proposition l2 . 61 yields 



iV + 2 



^E[d(U(t),U'(t))2] < -A [1-4 4] sin^ 

Gronwall's inequality shows that 

P2(UW,U'(t)f<E[d(U(i),U'(t)f] 

<exp(-A[l-44] sin^ ^^^^ 

for any initial distribution Fq of the bivariate chain. 

Taking in the infimuni over all couplings Fq of fiQ and /ig yields 



E[d(U(t),U'(t))' 



t) E[d(U(0),U'(0))^ 



/92(U(i), U'(t))2 < exp ( - A [1 - 4r7^] sin 



N ■ 



< (iV - 1) exp - A [ 



A [1 - 4 al] sin^ 



tj p2(U(0),U'(0))2 

TT 



iV + 2 



where the second inequality is due to the estimate on the diameter of S^^n provided 
in Lemma [2.41 □ 



By definition of the metric d(., .) on 5c_7v in terms of d(., .) it follows immediately 
from Proposition 12.81 that there is at most one stationary distribution for X{t) on 
each Sc^N, and that the rate of convergence in the associated Vaserstein distance is 
the same as the rate of convergence for U (t) . 

Furthermore, by assumption the process X{t) on S^^n generated by £ is stochas- 
tically continuous. Hence the compactness of Sc^n (in the topology induced by the 
chosen metric) allows us to apply the Bogolyubov-Krylov argument to show that 
there is at least one stationary distribution. This proves the following Theorem l2.9l 



Theorem 2.9 (Ergodicity and mixing rate of X(t) on each Se^N-)- If the transition 
kernel P satisfies J P{da) a — ^, and ap < j, then there exists a unique stationary 
distribution tt^^n on Se,N- Furthermore, 



P2(X(i),7r£,Ar) < P2(X(0),7re,Ar) exp --A[l- 4 cTp] sin 
< e iV VN - 1 exp - i A [1 - 4 cr|] sin 
holds for all t, and any initial distribution ofX{0) on S^^n- 



N- 



N + 2 



Spectral gap. In order to analyse the spectrum of £ in L^^ ^ we will 
make an extra assumption on the invariant measure tTj jv- Recall that a measure /i 
is called reversible under £ if for all bounded / : S^^n x S^^n K 

(8) I fi{dx) [£f{.,x)]{x) = I t,{dx) [£f{x,.)]{x) 

holds. In particular, considering functions / of the form f{x,x') — F{x) for some 
bounded F: S^^n — ^ K shows that fi must be invariant under £. 

Furthermore, £ acts on as a bounded, self-adjoint negative semi-definite 
operator. An estimate on the size of its spectral gap is provided in Theorem 12.121 
below. Because the result of the following Lemma 12.101 will play a central role 
in the proof of Theorem 12.121 we include the details of this well-known result for 
completeness. 
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Lemma 2.10 (Auxiliary estimate on the spectrum of a self-adjoint operator). Let 

H be a real (or complex) Hilbert space and T: H ^ H a bounded, self-adjoint 
linear operator. Suppose there exists a constant < 7 and a dense subspace G <Z H 
on which for all g G G and f H there exists a constant C j\g > such that 
\{ f,T"'g) \ < Cf^gY^ for all n>l. Then the spectrum of T is contained in [—7,7]. 

Proof. The classical spectral theory of bounded self-adjoint linear operators [3] 
states that the spectrum a{T) of T is a compact interval in [— || T || . || T ||], and 
there exists a unique spectral measure E(dX) such that for any f^g£H 

1 = / EidX) , T" = / X-EidX) , (T"/,g) - / A" (i?(dA)/,g) 
Jr Jr Jr 

where E{dX) is supported on cr{T), and mf^g{dX) = {E(dX)f,g) is a finite signed 
measure on cr(T), whose total variation norm satisfies |r7i/^g|Tv < II / II II 5 II- 

Suppose that the spectrum a{T) of T is not contained in [—7,7]. Then there 
exists s > 7 such that for Ss — (—00, — s) U (s, 00) the projection E{Ss) is nonzero. 
Hence there exists a nonzero fs (z H with E{Ss)fs — fs- In particular, 

WfsW'^f raf^jAdX)= I mf^jM>^)>0, 
J<y(T) JSs 

because the support of the measure nnf^j^[dX) is contained in Sg by choice of fg. 
In particular, ruf^j^ ^ 0. 

For any g € G, and all n > we have 



Due to the assumption on G we also have that 



Since rnf^^g is a finite measure, and |^| > ^ > 1 on its support, the boundedness 
of the above expression for all n can only be satisfied if in fact ruf^^g ~ 0. 

Thus we have shown that nif^j^ 7^ 0, but rnf^^g = for all g G G. Since 
mf^,g is continuous in g (in fact linear and bounded) the denseness of G implies 
that there exists a sequence {gn)n>i C G such that — fg in and hence 
= TTif^^g^ — > TUf^j^ ^ 0. This is a contradiction to continuity. Therefore the 
assumption on s must have been wrong, so that for all s > 7 the projection E{Ss) 
must be zero. And since A S M is in the resolvent set of T if and only if there exists 
an open neighborhood S* of A such that £^(5*) = it follows that cr{T) C [—7, 7]. □ 

Lemma 2.11 (Lipschitz contraction). Let A: S^^n — >■ K. 6e a Lipschitz continuous 
function with respect to the distance d(., .), and set At{x) = E[A(X(t)) | X(t) = x\ 
for all t > and x G Se,N- Then At is Lipschitz continuous with Lipschitz constant 



Lip(^t) < Lip(^) exp ( - - A [1 - 4(Tp] sin 



iV + 2 



for all t > 0. 



Proof. By Jensen's inequality it follows immediately from the very definition of 
the Vaserstein distance that pp^{X{t),X' {t)) < (X(t), X'(t)) for all 1 < _pi < P2- 
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Therefore it follows from Propositfon 12.81 that 



pi(X(0,X'(t)) <p2(X(0),X'(0)) exp 



A [1 -Acrl] sin^ 



iV + 2 



for any joint distribution of (X(0),X'(0)) on S^^n x S^^n- 
Note that 5g^7v is compact, and hence 

sup \EA{X{t))^EA{X'{t))\ =pi(X(t),X'(t)) 

Lip(A)<l 

which is the well-know Kantorovich- Rubinstein duality theorem for the Vaserstein-1 
metric. 

Using the specific initial distribution (X(0),X'(0)) — {x,x') on Se,N x Se,N we 
obtain 



\At (x) - ^ (a;') I < Lip(A) pi (X(t) , X' (<)) 

< Lip(yl) d{x, x') exp f - A [1 - 4 al] sin^ 



1 



iV + 2 



because in this case p2(X(0), X'(0)) ~ d{x,x'). And since x,x' G S^^^m are arbitrary 
we see that At is Lipschitz continuous with the claimed estimate on its Lipschitz 
constant. □ 

Combining now the result of Lemma 12.111 with that of Lemma 12.101 we are in 
a position to estimate the spectral gap of £ acting on L^^ ^ , provided we assume 
that the stationary distribution tt^.n is reversible. In this case £ is a self-adjoint, 
bounded, negative semi-definite operator on L^^ ^ . 



Theorem 2.12 (Ll 

/ P{da) a = ^ '^'^'^ CTp < ^ 
reversible, then 



spectral gap for reversible tt^^n)- Suppose that P satisfies 
p ^ J. If the stationary distribution tt^jv of X{t) on S^^n is 



1 



a{C) C ( -oo,-^A[l-4cr|,] sin^ 



U{0} 



.N + 2\ 

where is a simple eigenvalue corresponding to the constant eigenfunction. 

Proof. By assumption C generates a self-adjoint, positive semi-definite contraction 
semigroup e*'' on L'^^^, which satisfies e*'-'l = 1. Therefore, the subspace H of 
L'^^ ^ consisting of functions perpendicular to the constant functions is invariant. 
Hence, the decomposition L^^ ^ = H Q) span{l} is invariant under e*''', and e*'-' 
may be restricted to H. 

Furthermore, it is a consequence of Lusin's theorem [17] that the set of Lips- 
chitz continuous functions on Sc.n is dense in L^^ ^. Hence the set G of Lipschitz 
continuous functions A on S^^n with J Tr^^N{dx) A{x) = is dense in H. 

By Lemma 12.41 and the mean value theorem, for any f (z H and g (z G 



\{f,9)\ < 11/11 ||5ll < 11/11 diam5,,^Lip(5) < ||/|| eNVN^Up{g) 
and hence 

1 



I ( /, e" * ^.g ) I < II / II e iV Lip(5) exp ( - - A [1 - 4 4] sin^ 

follows from Lemma [2. Ill for all n > 0. 



TV- 
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Since e*'-' is a positive operator the result of Lemma 12.101 yields 

TT 



<j{e'^\H) C (0,exp( - iA[l-4 4] sin^ 



iV + 2 



This implies 



a[C\H) = \ loga(e*^) C ( - cx), -i A [1 - 4^] sin^ 



-TV- 

which finishes the proof. □ 

Remark 2.13. From the proof of Theorem ] 2. 12\ it is clear that the abstract result 
Lemma \2.m shows that an estimate on the exponential rate of weak convergence of 
X(i) in Vaserstein-1 distance automatically yields an estimate on the spectral gap 
of C on L^, provided that the stationary distribution it is reversible. And since 
convergence in Vaserstein-1 distance can be controlled by two different approaches 
(recall the Kantorovich- Rubinstein duality theorem) we expect this general result to 
be also useful in other settings to prove estimates on spectral gaps. 

Remark 2.14. All results of this section are essentially conseguences of Proposi- 
tion \2.6\ and Lemma \2.1(A And since the statement of Proposition \2.6\ is readily 
rephrased for the embedded discrete time Markov chain with transition operator 

yAix) = Aix) + —— - CA{x) = J2 -TT^ / Pida) A{T,,^x) 

the results of this section all carry over (essentially verbatim) to the discrete time 
setting. One only has to multiply the rate of convergence ( and hence the spectral 
gap) by \ in the results for continuous time to obtain the corresponding results 
for the discrete time setting. 

3. Spectral gap in ^ for the general case 

Now we consider the general situation where the continuous-time Markov process 
X(t) is generated by the infinitesimal generator C given in ([1}. Suppose that tt^^n 
is a reversible measure for C. Then the associated Dirichlet form 

(9a) V^^n{A) = j TT^^N{dx)A{x)[-CA]{x) 

is defined for all A £ L^. ^ , and has the representation 
(9b) 

^N-l 

V^^NiA) = 2 X! / '^e,Nidx) A(xi,Xi+i) / P{xi,Xi+i,da) [A{T^^ax) - A{x)]'^ . 
i=i 

The basic idea to prove convergence rates for X{t) is to compare the spectral gap 
of its generator £ to a suitably chosen reference process of the type (jS]) considered 
in Section [21 In order to distinguish these two generators we use a superscript * 



CAix) J P\da) [A{T,^^x) - A{x)] 



1 " 



Kn{A) = - I nl^idx) ^ A* / P*{da) [A{T,^^x) - A(x)]^ 

i—1 
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to denote the invariant measure, the generator and the corresponding Dirichlet form 
of the reference process. 

Theorem 3.1 (Spectral gap for C). Fix e > and N , and let tt^jv be a reversible 
stationary distribution of C on S^.n- Suppose that there exist a constant A* > 
and a probability measure P* on [0,1] with mean J P*{da)a = ^ and variance 
CTp* < J such that the following are satisfied: 

(i) The rate function A satisfies A(xi,Xi+i) > A* for tt^^n -almost all x G Se.N, 
and all 1 < i < N - 1. 

(ii) There exists a constant j3 > such that P satisfies the minorization condition 
P{xi, Xi+i, ■) > P P*i-) for TT^^N-almost all x G Se,N, and all 1 < i < N — 1. 

(Hi) The unique (recall Theorem \2.9\) stationary distribution tt*^ of C* on S^^n 

(corresponding to A* and P* ) is reversible, 
(iv) The measures tt^^n o,nd tt* are uniformly equivalent, i.e. there exist two 

constants < < C+ < oo such that their Radon-Nikodym derivative 

satisfies < < C+ for all N. 

Then the spectrum of C in L^^ ^ satisfies 

a{C) c ( - c^, -/3 ^ A'^ 1 [1 - 4 4.] sin^ 

where is a simple eigenvalue. 

Remark 3.2. Later we will see that - apart from condition - the conditions of 
Theorem \3.1\ are fulfilled in a wide range of models of mechanical origin, interesting 
to us. Indeed, in Theorem \4.S\ we will prove a characterization of reversible mea- 
sures of a particular type. Among others, it will provide the existence of reversible 
stationary measures for a large class rate functions A and transition kernels P. This 
result, in particular, addresses conditions (pH]) and (fiv|) in the above Theorem \3.1\ 
in quite satisfactory generality. Also, in Section O we show that jul is satisfied, 
for instance, in the Gaspard- Gilbert model with three-dimensional balls. Finally, ^ 
is the consequence of our method. Nevertheless establishing hydrodynamical limit 
transition is a great challenge even under our conditions and for doing it our theo- 
rems serve as an excellent background. Finally we note that the applicability of the 
statement in its present form seems to be restricted to models where tt^^n = ""e at 
therefore the weakening of condition |iv|) would also be desirable. 

Proof. Since we assume reversibiUty the generator is self-adjoint, and hence we have 
the following variational characterization 





TT 






J 




2. 





U{0}, 



of the spectral gap 7 of £ acting on L^^ ^, where Vare^7v(A) denotes the variance 
of A with respect to ir^^isf. 
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By assumption we can compare the measures tt^.n, '^t ^^'^ P* ^ that for 
the Dirichlet form, recah (|9]), we obtam the estimate 

^JV-l 

T^e,N{A) 2 X! / ^^.N{dx) K{xi,Xi+i) I P{x^,x^+i,da)[A{Ti^ax) - A{x)Y 
i=i 

JV-l 

>I3C--J2 / <Nidx) A* / P*{da) [v4(T,,«x) - Aix)f 
i=i 

which is nothing else but V,^n{A) > /3 C~ for all Ae Ll^^. 

Furthermore, the variational characterization of the variance yields the estimate 

Var,,jv(A) = inf / [A{x) - c]^ = inf / t:*^ j,{dx) ^^4^^ [^(^) " ^1' 

cm J ceMj • Tr* j^(dx) 

< Ct inf / <^(da:) [A{x) ~ cf = Ct Vait^(A) 

for all A^L\^^. 

Combining both of the above estimates shows 

I?,,jv(A) ^^C- Vl^[A) 



Var,,^(A) - ^ Ct Var*^(A) 

for any A e ^ with Varj^jv(^) 7^ 0. In other words, the spectral gap 7 of £ 
admits the estimate 



Finally, note that the assumed bounds C^ < < imply that L^^ 

Lt* so that the above estimate can be rewritten as 

e,JV 

where 7"^ denotes the spectral gap of C* in L^* ^■ 
Now recall that by Theorem 12.121 

7* > ^A* sin^ 

which in turn shows for the spectral gap 7 of £ 

7>/3^A*i[l-4a|,.]sin2 



N + 2 



N + 2 



which finishes the proof. □ 

4. Classification of reversible product measures 

In this section we will characterize reversible product measures of X(i). It is 
worth recalling at this point that for any N fixed the sets S^^n C R+ are invariant 
for the process for any choice of e > 0. And since these are simplexes there are 
no (non-trivial) product measures ^(dx) = v(dxi) ■ • • v(dxN) supported by a single 
Se,N- However, conditioning an invariant product measure on all of to any Sc,n 
yields an invariant measure on Se,N- Therefore, we will consider product measures 
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on all of (canonical measures) instead on the ergodic components Se,N (micro- 
canonical measures). And since our main convergence result Theorem 13.11 is for 
reversible invariant measures, we consider here only reversible product measures. 

The first step in classifying all of them is provided by Lemma 14. 1[ which says 
that it suffices to consider N = 2. 

Lemma 4.1 (Reversible product measures and system size). Let v be a probability 
measure on R+. Then the product (probability) measure ^{dx) — ^{dxi) ■ ■ ■ v{dxN) 
on is reversible for X(t) (with generator ^) for some N if and only if it is 
reversible for N = 2. 

Proof Let A : X ^ M be bounded. To shorten the notation we use [CA{x, .)] 
to denote the function obtained by the action of the generator C on second variable 
of the function A{x,x'), while treating the first variable as a parameter. Further 
we use [£A{x, ■)]{x') to denote the evaluation of the function [CA{x, .)] at the point 
x' . Correspondingly, in [£^(.,a;')] the second variable is treated as a parameter. 
By definition ([T]) of the generator £ we have 

N-l 

/ ^j.{dx)[CA{.,x)]{x) ^ / iy{dxi) ■ ■ ■ iy{dxN) A{xi,Xi+i)- 

■ j P{xi, Xi+i,da) [A{Tt^aX, x) - A{x, x)] 

N-l 

= y2 iy{dxi)i/{dxi+i)A{xi,Xi+i) j P{xi,Xi+i,da)- 

■\Ai^i+i(a[xi +Xi+i],{l - a) [xi + Xi+i],Xi,Xi+i) 

Ai^i^l (^Xi , XiJ^\ , Xi , Xi^\ ) , 

where we used the short hand notation 

Ai,i+i{xi,Xi+i,x[,x'^j.{) = / v{dxi) ■ ■ ■ v{dxi-i) v{dxi+2) ■ ■ ■ v{dxN) A{x, Zi) , 

Zi — (xi , . . . , — 1 , , , Xi^2 1 • • • ^ Xj\j^ . 

Recall that reversibility means /jjn ^{dx) [CA{., x)]{x) = J^n ^i{dx) [CA{x, .)](2:), so 
that reversibility holds if and only if 

/ v{dxi)i'{dxi+i) K[xi,Xi+i) j P[xi,Xi+i,da)- 

i=l -^18+ 

• Ai^i+i{a [xi + Xi+i], (1 - a) [xi + x^+i], x^, x^+i) 

N-l 

= ^ / v{dxi)v{dxi+i) K{xi,Xi+i) / P{xi,Xi+i,da) 
i=i 

• Ai^i+i{xi,Xi+i,a[xi + Xi+i], (1 - a) [xi + x.,+i]) 



for any bounded A : x ]Ri|^ 
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In the particular case where ^(x, x') — 0(a;i, x'l) for some bounded ^: M+xM+ 
Ai^^+l{xi,x^+l,x'^,x'^_^_l) = / iy{dxi) <j){xi, Xi) = const 



for alH = 2, . . . , — 1. Hence reversibiUty requires 

iy{dxi)v{dx2) A{xi,X2) / P{xi, X2, da) (f>{a [xi + X2],xi) 



iy{dxi)iy{dx2) A^{xi,X2) / P{xi,X2,da) 4}{xi,a[xi + X2]) 



Consider now A{x,x') — a;2, x'j^, a;2) for some bounded tp: M."^ x I 

Then 

Ai^2{xi,X2,x[,X2) = ll:{xi,X2,x\,x'2) 

A2,3{x2,X:i,x'2,x'^) = / v{dxi)ll}{xi,X2,Xi,X2)=i)(x2,x'2) 



A^^^+l{x^,x^+l,x[,x[^-^) ^ / ) 1/(^X2 ) V'la^^i , 2^2 , a:^i , 2:2) = coust 

for all i = 3, . . . , — 1. Combining this with the previous special case (applied to 
4> = ip) shows that reversibility requires 



^{dxi) i'{dx2) A{xi, X2) J P{xi,X2,da)- 

■ tpia [xi + X2], (1 - a) [xi + X2],xi,X2) 
i^{dxi) i'{dx2) A{xi,X2) / P{xi,X2,da)- 



■ ij{xi,X2, a [xi + X2], (1 - a) [xi + X2]) 

for any bounded test function tp : x — > M. And since this is also sufficient for 
reversibility, it follows that reversibility of the product measure holds if and only if 
the above equality holds for all tp. 

Finally, observe that this last expression is precisely the reversibility condition 
for N = 2, which finishes the proof. □ 

The final expression in the above proof actually shows that reversibility of the 
product measure is equivalent to a slightly stronger statement than the one stated 
in Lemma WA\ Namely, because both integrands agree at (0,0) the reversibility of 
the product measure is equivalent to 



i^(dxi) iy{dx2) A{xi, X2) / P{xi,X2,da)- 
^\{(o,o)} J 

^^^^ ■ ipia[xi +X2], (1 - a) [xi +X2],xi,X2) 

= / iy{dxi) iy{dx2) A{xi,X2) / Pixi,X2,da)- 

JrI\{(o,o)} J 

■ tp{xi,X2,a [xi + X2], (1 — a) [xi + X2]) 
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This simplification is relevant, because so far we have not ruled out yet the possi- 
bility of V having an atom at 0. 

For the further analysis we will need to assume that the rate function A and the 
transition kernel P are of the form 



k[xi, Xi+i) = As(a;, + x^+i) ( — ) 



P(xi , Xi+i , da) = P I , da 

\ X'l -\- Xi^\ 

Here the subscripts s and r stand for "sum" and "ratio", respectively. Note that 
^ ^ makes sense everywhere on M^^ \ {(0,0)}, and by the above this set is all 
that we need to consider. In Section [5] below we will see that the representation 
(|lip naturally occurs in models originating from mechanical systems. 

We have already shown that in order to classify reversible product measures for 
arbitrary TV it is enough to study the case N = 2. This, however, is still not a 
completely straightforward problem, since the answer might depend on the rate 
functions As and A^. The next Corollarv 14. 21 simplifies this issue. 



Corollary 4.2 (Reversible product measures and rate functions). If As{ri) > for 
all < T] < oo, then the process has a reversible stationary product measure /i (as 
in Lemma \4.1^ if and only if 

Xi 

L\{(o,o)} ^xi+X2^ J va;i-|-a;2 



y{dxi)y{dx2)K{ ] P( ""^ ^da\ 

\Xl + X2'' J ^Xl 



ri\xi + X2,a, ■ 



Xi + X2 

v{dxi) v[d 

L\{(0.0)} 



v{dxi)u{dx2)Ar{ \ P( ,da)- 

\Xx+X2^ J \Xi + X2 ) 

■ rjfxi + X2, — ^ — ,a) 



holds for all bounded r/: R+ \ {0} x [0, 1]^ R. 

Proof. By (fTO)) reversibility of the product measure is equivalent to 



/ v{dxi) v{dx2) As{xi + X2) 

JR^\{(0,0)} ^Xi+X2y J \xi+X2 I 

• '(/'(a [a;i + X2] , (1 - a) {x-y + 0:2] ,xi,X2) 

v(dx\)v(dx2) As(x\^ X2) aA ^- — J / P\ ^- — , da J • 

|\{(0,0)} \X\+X2' J \Xi+X2 / 

• '\\}{xi,X2^ a [xi + ^2], (1 - a) [xi + 2:2]) 

for any (non-negative) test function V: \ {(0, 0)} x _j. Qn R\ \ {(0, 0)} 
the change of coordinates {xi,X2) ^ {xi +X2, ^x-Ixq, ) one-to-one, hence any such 
function i\) may be recast as 



20 



ALEXANDER GRIGO, KONSTANTIN KHANIN, AND DOMOKOS SZASZ 



for some function 77: (M+ x [0, 1])^ — > M. Therefore reversibility holds if and only if 
/ v{dxi) v{dx2) -^^sixi + X2) 

JR2\/mn^\ Vxi +X9/ I Vxi + X9 / 



«^\{(0,0)} 

■?](xi +X2,a, — — — ) 

V XT + X2 / 



Xl + X2 

v' " 



L\{(o,o)} 



(dx\) v(dx2) As(xi + ^2) 

\X\ + X2' J \X\^' X2 ' 

■ i](xi + X2, — — — ,a) 



holds for ah r/: R+ \ {0} x [0, 1]^ R. 

And since xi + X2 > our assumption on implies that A^ is strictly positive, 
and hence may as well be combined with 77, because 77 is arbitrary. This finishes 
the proof. □ 



With Lemma 14. 1[ and Corollary 14.21 we are now in a position to classify all re- 
versible product measures, which is the content of the following Theorem l4.3l This 
classification relies on a well-known fact 1161 about Gamma distributions. Namely, 
suppose that Xi and X2 are two non-constant, independent, positive random vari- 
ables. Then Xi + X2 and x^X2 independent if and only if Xi and X2 are 
independent, identically Gamma-distributed random variables. 

In the theorem below we use the following notation: For e > we denote by 
(5(e, da) the Dirac measure concentrated at e. 

Theorem 4.3 (Reversible product measures). Suppose that the Markov chain on 
[0,1] with transition kernel P{l3,da) has a unique invariant distribution, say p{.). 
Let N be arbitrary, and suppose further that A^ is such that As{a) > for all 
a > 0, and Ar(/3) > for all < /3 < 1. Then the product measure fi{dx) = 
i>{dxi) ■ ■ ■ v{dxN) is reversible for X(t) if and only if p is a reversible measure for 
the Markov chain generated by P , and either of the following two holds: 

(1) There exists e > and d > such that 

dxi 



v{dxi 



r(i) 



p{dp) = dp [/? (1 - /?)] K{fi) \ 

where Z is the normalizing constant. 
(2) There exists e > such that i'{dxi) — S{e,dxi), p{da) — (5(i,da), and 
P{\,da) = 5{\,da). 

Proof. From Lemma [4.11 we know that it suffices to consider N = 2, and Corol- 
lary |4?2] shows - as it is also clear intuitively - that the choice of A^ is irrelevant, 
and that we only need to consider the process on \ {(0, 0)}. 

Using the change of variables a = xi + X2 , 13 — — on \ {(0,0)} we 
can disintegrate the product measure i'{dxi) h'{dx2) such that for any (bounded) 
77 : — K we have 

/ iy{dxi) i'{dx2)r]{xi,X2) ^ / i^s{da) / i^r{<^,dl3)ri{l3 a,{l - fi)a) 

jRl\{iO,0)} "'R+\{0} "'[0,1] 
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where i's{.) is the distribution of the sum xi + X2 and j/r(cr, .) is the conditional 
distribution of the ratio ^^^^^^ given that xi + X2 = cr. 

Using this notation the condition for the reversibihty of the product measure of 
Corohary 132] takes on the form 



vs{da) J Vr{(T,dP)K{P) J P{P,da)r,{a,P,a) . 
This holds if and only if for -almost every a 

f Vr{a,d(i)K{P) I P{p,da)n{a,(i) 

(12) 

Vr{a,dp)K{P) / P{P,da)f]{P,a) 



for aU bounded fy: [0, 1] x [0, 1] ^ K. 

Suppose now that the product measure is reversible. The special choice ■q{a,(3) — 
ipio) for some i/j: [0, 1] — R thus shows that 

M<y,d/3) AriP) J F(/3,da)V(a) = J Ma.dp) kr{p) ^{fi) 

for all In other words, the (not normalized) non-negative measure 1^^(17, dP) Ar{/3) 
must be invariant under P. And since by assumption P has a unique invariant 
distribution, denote it by p, it thus follows that 

i i/, (a, d/3) A, (/3) = p(d/3) , Z = / i^, (a, d/3) A, (/3) 



for z/j -almost every a, where Z > by assumption on A^. 

In particular, this means that the conditional distribution Vr{a^.) of the ratio 
given that a = xi-\-X2 actually is the same for all values of a. In other words 



X1+X2 



the sum xi + X2 and the ratio ^^^^^ are independent. And since also xi and X2 
are independent (by assumption) we conclude [161 that either is a point mass, i.e. 
v^dxi) = S{e,dxi) for some e > 0, or is a Gamma distribution, i.e. 

, , dxi rxi] 2-1 e~~ 
e lei r(f ) 

for some e > and d > 

In the former case it follows 

i^sida) = 6i2e,da) , p(d/?) = z/,(d/3) = S{^,dl3) 

for i^s, i^r- Hence the reversibility condition (fT2|) becomes J P{^,da)ri{a, = 
f P{^, da) 7](i. a) for all 77, which is equivalent to 



P{l,da)^s{l,da) 



Similarly, in the latter case 



da 



e 



a 



Insider) ^ — -, MdP)^dp[/3{l-p)] 



4-1 r(d) 



r(d) ' "^'^^ r(|)2 
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follows for Vs, i^r, where we used the well-known properties of Gamma and Beta 
distributions. The reversibility condition (|12p becomes 

for all rj, and 

p{dl3) ^ dp [/3 (1 - ^ Kifi) \ 

must be the expression for the unique stationary distribution of P. 

This proves that if the product measure is reversible, then v is either constant, or 
a Gamma distribution, and the transition kernel must have the claimed stationary 
distribution. 

To finish the proof it remains to consider the converse. Assume either of the 
two possible distributions for v and also the corresponding assumption on P. For 
these special distributions it is well-known (and easily verified) that the sum and 
the ratio are independent with the distributions as considered above. Hence we see 
that the reversibility condition ([T2l) is indeed satisfied. □ 

We finish the discussion of reversible product measures with the following re- 
mark. Note that in the statement of Theorem 14.31 there is an assumption on the 
kernel P that appears in the generator of the process X(t). By Lemma [4.11 and 
Corollary 14.21 it suffices to consider the reversibility of the product measure for 
N — 2 and constant rates. Upon restricting this process to any of the invariant sets 
Se^2^ the embedded discrete time Markov chain is precisely the Markov chain on 
[0, 1] with transition kernel P(/3, da). Therefore, the assumption in Theorem l4.3fo n 
the kernel P is equivalent to saying that for = 2 and constant rates the process 
X(f) has a unique stationary distribution on any of the 5^,2- 

A sufficient condition for this uniqueness is to assume that P satisfies a uniform 
minorization condition, i.e. there exists a constant 7 > and a probability measure 
P* on [0, 1] with / P*{da) a = i and o-|,. < \ such that P{f3, .) > -fP*{.) for aU 
f3 £ [0, 1]. Recall that this is also the type of condition P assumed in Theorem 13. II 



5. Example: the rarely interacting billiard lattice 

Here we illustrate the use of Theorem 13.11 and Theorem 14.31 with the billiard 
lattice model studied in |10| . which was one of the main motivations for our work 
presented in this paper. It was argued in |10j that in the limit of rare collisions the 
dynamics of a billiard lattice becomes a Markov jump process. The notation used 
in [10| differs from ours in that we separate the rate of interaction As A^ from the 
transition probability kernel P, whereas in [TU] the product A^ A^ P is denoted by 
W, and the rate function A^ A,, is denoted by 1^. Changing equations (61) and (62) 
of [To] to our notation yields 



mM^3 WMeS, A.(/3).^i±£Iii=£), Us) 

da 2 i-K/?V(l-/3) ' 6 ^/3v(l-/3) ^ ' 
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for the transition kernel P and the rate functions and A^, respectively. The 
symbol V denotes the maximum and A denotes the minimum. 

Since the underlying mechanical model has a three-dimensional configuration 
space for each of the constituent particles it follows that 



d = 3 , i/{dxi) 



da \<J 



dxi [xi 2 e s 



eve \/7r 



e L e J z TT 
g Y 

p{da) — da i/a (1 — a) — Aj.{a) — 

TT Z 

should be used in Theorem 14.31 In fact, this measure is the (canonical) Gibbs 
measure for the mechanical model, and thus must also be invariant for the limiting 
jump process. 

Another general property that the jump process inherits from the underlying 
mechanical model is that the rate function A is proportional to the square root 
of the total energy of the two sites that interact, i.e. As(ct) — ^fa as mentioned 
above. This cannot be avoided when taking scaling limits of interacting mechanical 
models, because it corresponds to the kinematic scaling relation between the energy 
and the velocity (and hence the time scale). However, a rate function without a 
uniform lower bound leads to serious technical complications at various levels. See, 
for example, ^ for how this issue seriously complicates the rigorous derivation of 
the weak interaction limit of a related deterministic model. 

Furthermore, such a rate function also complicates the rigorous analysis of the 
rate of convergence to equilibrium. In fact, in order to apply the results established 
in this paper we need to have A^ bounded from below. Recall that we showed in 
Lemma l4 . II that the above reversible product measure is also a reversible stationary 
distribution for the process generated by the infinitesimal generator corresponding 
to any other function Ag (while keeping A^ and P unchanged). And since A^ 
represents the kinematic scaling, and not the nature of the energy exchange during 
an interaction, we will change the model of |10j in that we change Aj. In fact, our 
next Lemma is most useful exactly under the setup of the aforementioned work. 

Lemma 5.1. // A^ is replaced by any non-negative continuous function, which is 
bounded away from zero, then the following hold for any N and e. 

dxi fxT 2 e 



(1) The product measure fi{dx) = ^{dxi) ■ ■ ■ v{dxN) with v{dxi) — y — 

is the unique reversible product measure for X(t). 

(2) On every S^^n there exists a unique stationary distribution 'Ke,N ■ This mea- 
sure is obtained by conditioning fi{dx). 

(3) The spectrum cr{C) of the generator C acting on satisfies 



a(£)c (-oo,-Csin2 [_^]] u{0} 



for some constant C, which may depend on the choice of As- 
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Proof. The explicit expressions for the transition kernel and the rate functions allow 
us to show 

^ P(/3, da) ^ ^/2^ ^^^"jl-f) ^ V/3 A(l - /?) A y/g A(l - a) 

da 4 V/^V(l-/3) 4 V/^ (1 - /?) 

for all a,l3 e [0, 1]. Hence P(/3, da) = p(da) P(a, d/S), i.e. 

p(d/3) / P(/?,da)V(a,/3) = / / P(/3, da) a) 



holds for ah i/j: [0,1]^ ^ M. 
Furthermore, the estimate 



-1 . / a A(1 — q) 

Pjp.da) _ Stt 1^ V ;3a(i-/3) 1 
j.,(da) ~ 16 i+/?V(l-/?) ^a(l-a) 



> 



Stt 
16 
Stt 
16 



i + /3V(l-/3) [i + /3v(l-/3)] x//?A(l-/3) 



TT 
4 

provides the minorization condition P(/3, da) > |-i^r(da)- In particular, this implies 
that the Markov chain on [0, 1] with transition kernel P has a unique invariant 
measure. 

Therefore, it follows then from Theorem 14.31 that /i((ix) is a reversible product 
measure, and must be unique. 

Observe that by Theorem l4.3l the infinitesimal generator corresponding to P*{da) 
Vrida) and a constant rate function also has iTe.N as a stationary reversible distri- 
bution. Combining this with the above minorization condition for P and < 

Ar(/3) < -^^^ we see that under the assumption that As is bounded from below all 
assumptions of Theorem 13. II are satisfied. □ 



The significance of the result of Lemma 15.11 is that it provides an interesting 
model that fits the conditions of Theorem 13.11 We would like to point out that 
previous to [10] the analogous two-dimensional billiard network was studied in 
[9]. However, in this case the uniform mixing condition ([n]) of Theorem 13.11 fails to 
hold, which is why we restricted our attention in the above to the three-dimensional 
setting. 

6. Conclusion 

The authors of [10] suggested a two step strategy for deriving the heat equation 
from a mechanical model. Motivated by that we have introduced in this work a class 
of stochastic models with the aim to implement the second step of their strategy: 
the derivation of the heat equation from a mesoscopic stochastic model. 

At present it is widely understood that a necessary ingredient to rigorously 
establish the hydrodynamical limit is a sharp bound on the dependence of the 
spectral gap of the generator on the system size. Such a bound is one of the main 
results of the present paper. 

Besides the importance of this bound for the hydrodynamic limit, an additional 
value of our result is that for systems with continuous state space such bounds are 
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hard and scarce, e.g. [12l [1]. As in those works, our method requires to assume 
that the rates are bounded away from zero. 

In more detail: according to our main result the spectral gap of the infinitesimal 
generator of the process scales as 0(iV^^) in terms of the systems size N. This 
is precisely the kind of scaling which allows for a diffusive scaling limit, and hence 
the study of the hydrodynamic limit. However, we do not study the hydrodynamic 
limit in this paper, because it requires different ideas and techniques, and results 
on the spectral gap are of interest in their own right. 

To keep the model as close to the mechanical ones as possible (cf. Section [5] 
and [10]) it is desirable to remove the assumption on existence of a uniform lower 
bound of the rate function. Numerical simulations suggest that the 0{N^'^) scaling 
of the spectral gap remains true also for rate functions that can approach zero. In 
particular for the square root of the total energy of the interacting pair, which is 
the rate function that appears in mechanical models due to the kinematic scaling 
of the velocity with the energy. However, we do not have a rigorous proof of such 
a statement available at present. 



References 

[1] E. A. Carlen, M. C. Carvalho, and M. Loss. Determination of the spectral gap for Kac's 
master equation and related stochastic evolution. Acta Math., 191(l):l-54, 2003. 

[2] Eric A. Carlen, Jeffrey S. Geronimo, and Michael Loss. Determination of the spectral gap 
in the Kac model for physical momentum and energy-conserving collisions. SI AM J. Math. 
Anal., 40(l):327-364, 2008. 

[3] John B. Conway. A course in functional analysis, volume 96 of Graduate Texts in Mathe- 
matics. Springer- Verlag, New York, second edition, 1990. 

[4] D. Dolgopyat and C. Liverani. Energy transfer in a fast-slow Hamiltonian system. ArXiv 
e-prints, October 2010. 

[5] J. -P. Eckmann and L.-S. Young. Nonequilibrium energy profiles for a class of 1-D models. 
Comm. Math. Phys., 262(l):237-267, 2006. 

[6] J. Fritz. On the hydrodynamic limit of a one-dimensional Ginzburg-Landau lattice model. 
The a priori bounds. J. Statist. Phys., 47(3-4):551-572, 1987. 

[7] J. Fritz. On the hydrodynamic limit of a scalar Ginzburg-Landau lattice model: the resolvent 
approach. In Hydrodynamic behavior and interacting particle systems (Minneapolis, Minn., 
1986), volume 9 of IMA Vol Math. Appl, pages 75-97. Springer, New York, 1987. 

[8] P. Gaspard and T. Gilbert. Heat conduction and fourier's law in a class of many particle 
dispersing billiards. New Journal of Physics, 10(10):30, 2008. 

[9] P. Gaspard and T. Gilbert. On the derivation of fouriers law in stochastic energy exchange 
systems. Journal of Statistical Mechanics: Theory and Experiment, 2008(11):20, 2008. 
[10] P. Gaspard and T. Gilbert. Heat transport in stochastic energy exchange models of lo- 
cally confined hard spheres. Journal of Statistical Mechanics: Theory and Experiment, 
2009(08) :24, 2009. 

[11] M. Z. Guo, G. C. Papanicolaou, and S. R. S. Varadhan. Nonlinear diffusion limit for a system 
with nearest neighbor interactions. Comm. Math. Phys., 118(l):31-59, 1988. 

[12] Elise Janvresse. Spectral gap for Kac's model of Boltzmann equation. Ann. Probab., 
29(l):288-304, 2001. 

[13] M. Kac. Foundations of kinetic theory. In Proceedings of the Third Berkeley Symposium on 
Mathematical Statistics and Probability, 1954-1955, vol. Ill, pages 171-197, Berkeley and 
Los Angeles, 1956. University of California Press. 

[14] C. Kipnis, C. Marchioro, and E. Presutti. Heat flow in an exactly solvable model. J. Statist. 
Phys., 27(l):65-74, 1982. 

[15] Claude Kipnis and Claudio Landim. Scaling limits of interacting particle systems, volume 
320 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Math- 
ematical Sciences]. Springer- Verlag, Berlin, 1999. 



26 



ALEXANDER GRIGO, KONSTANTIN KHANIN, AND DOMOKOS SZASZ 



[16] Eugene Lukacs. A characterization of the gamma distribution. Ann. Math. Statist., 26:319- 
324, 1955. 

[17] Walter Rudin. Real and complex analysis. McGraw-Hill Book Co., New York, third edition, 
1987. 

[18] Horng-Tzer Yau. Logarithmic Sobolev inequahty for lattice gases with mixing conditions. 

Comm. Math. Phys., 181(2):367-408, 1996. 
[19] Horng-Tzer Yau. Logarithmic Sobolev inequality for generalized simple exclusion processes. 

Probab. Theory Related Fields, 109(4):507-538, 1997. 



Department of Mathematics, University of Toronto, Toronto, ON, Canada. 
E-mail address: alexander . grigoSutoronto . ca 

Department of Mathematics, University of Toronto, Toronto, ON, Canada. 
E-mail address: khaninOmath . utoronto . ca 

Budapest University of Technology and Economics, Mathematical Institute, Egry 
J. u. 1, 1111 Budapest, Hungary. 
E-mail address: szaszSmath.bme.hu 



